Search CORE

68 research outputs found

Learning the Visual Dynamics of Human Body Motions

Author: Ong Eng-Jon
Publication venue
Publication date: 30/12/2013
Field of study

A Thesis submitted to the University of London for the degree of Doctor of Philosoph

Queen Mary Research Online

Deep Architectures and Ensembles for Semantic Video Classification

Author: Bober Miroslaw
Bober-Irizar Mikel
Husain Sameed
Ong Eng-Jon
Publication venue
Publication date: 01/01/2018
Field of study

This work addresses the problem of accurate semantic labelling of short videos. To this end, a multitude of different deep nets, ranging from traditional recurrent neural networks (LSTM, GRU), temporal agnostic networks (FV,VLAD,BoW), fully connected neural networks mid-stage AV fusion and others. Additionally, we also propose a residual architecture-based DNN for video classification, with state-of-the art classification performance at significantly reduced complexity. Furthermore, we propose four new approaches to diversity-driven multi-net ensembling, one based on fast correlation measure and three incorporating a DNN-based combiner. We show that significant performance gains can be achieved by ensembling diverse nets and we investigate factors contributing to high diversity. Based on the extensive YouTube8M dataset, we provide an in-depth evaluation and analysis of their behaviour. We show that the performance of the ensemble is state-of-the-art achieving the highest accuracy on the YouTube-8M Kaggle test data. The performance of the ensemble of classifiers was also evaluated on the HMDB51 and UCF101 datasets, and show that the resulting method achieves comparable accuracy with state-of-the-art methods using similar input features

arXiv.org e-Print Archive

University of Surrey

Surrey Research Insight

MIMiC: Multimodal Interactive Motion Controller

Author: Dumebi Okwechime
Eng-Jon Ong
Richard Bowden
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Robust Facial Feature Tracking Using Shape-Constrained Multiresolution-Selected Linear Predictors

Author: Eng-Jon Ong
Richard Bowden
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Single-cell Subcellular Protein Localisation Using Novel Ensembles of Diverse Deep Architectures

Author: Bober Miroslaw
Bober-Irizar Mikel
Husain Syed Sameed
Irizar Amaia
Minskiy Dmitry
Ong Eng-Jon
Publication venue
Publication date: 16/09/2022
Field of study

Unravelling protein distributions within individual cells is key to understanding their function and state and indispensable to developing new treatments. Here we present the Hybrid subCellular Protein Localiser (HCPL), which learns from weakly labelled data to robustly localise single-cell subcellular protein patterns. It comprises innovative DNN architectures exploiting wavelet filters and learnt parametric activations that successfully tackle drastic cell variability. HCPL features correlation-based ensembling of novel architectures that boosts performance and aids generalisation. Large-scale data annotation is made feasible by our "AI-trains-AI" approach, which determines the visual integrity of cells and emphasises reliable labels for efficient training. In the Human Protein Atlas context, we demonstrate that HCPL defines state-of-the-art in the single-cell classification of protein localisation patterns. To better understand the inner workings of HCPL and assess its biological relevance, we analyse the contributions of each system component and dissect the emergent features from which the localisation predictions are derived

arXiv.org e-Print Archive

Directory of Open Access Journals

Applications of Face Analysis and Modeling in Media Production

Author: Cosker Darren
Eisert Peter
Grau Oliver
Hancock Peter J B
McKinnell Jonathan
Ong Eng-Jon
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/12/2012
Field of study

Facial expressions play an important role in day-by-day communication as well as media production. This article surveys automatic facial analysis and modeling methods using computer vision techniques and their applications for media production. The authors give a brief overview of the psychology of face perception and then describe some of the applications of computer vision and pattern recognition applied to face recognition in media production. This article also covers the automatic generation of face models, which are used in movie and TV productions for special effects in order to manipulate people's faces or combine real actors with computer graphics

Stirling Online Research Repository (RIOXX)

Stirling Online Research Repository

Applications of face analysis and modelling in media production:Overview of the state of the art

Author: Cosker Darren
Eisert Peter
Grau Oliver
Hancock Peter
McKinnel Joe
Ong Eng Jon
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2013
Field of study

OPUS

Fraunhofer-ePrints